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Abstract 

Quantum entanglement can be used in a communication scheme to establish a correlation 
between successive channel inputs that is impossible by classical means. It is known that the 
classical capacity of quantum channels can be enhanced by such entangled encoding schemes, 
but this is not always the case. In this paper, we prove that a strong converse theorem holds 
for the classical capacity of an entanglement-breaking channel even when it is assisted by a 
classical feedback link from the receiver to the transmitter. In doing so, we identify a bound on 
the strong converse exponent, which determines the exponentially decaying rate at which the 
success probability tends to zero, for a sequence of codes with communication rate exceeding 
capacity. Proving a strong converse, along with an achievability theorem, shows that the classical 
capacity is a sharp boundary between reliable and unreliable communication regimes. One of 
the main tools in our proof is the sandwiched Renyi relative entropy. The same method of proof 
is used to derive an exponential bound on the success probability when communicating over an 
arbitrary quantum channel assisted by classical feedback, provided that the transmitter does 
not use entangled encoding schemes. 


1 Introduction 

The classical theory of communication is one of the modern successes of applied mathematics |CT9H 
IGK12| . It is arguably one of the foundations of our current information age and provides new ways 
of thinking about problems in many other fields of study, such as physics and in particular quantum 
mechanics. The interaction between these two fields is mutual; while some problems in quantum 
mechanics can be turned into communication problems, the existence of quantum phenomena 
strongly suggests that we should rethink many aspects of communication theory. Not only does 
the notion of a quantum state challenge what we mean by “information,” but the possibilities due 
to quantum mechanics give rise to new classes of communication protocols. With respect to this 
latter consideration, some fundamental motivating questions for quantum information theory have 
traditionally been and still are the following: 

1. Is the theory of classical communication affected at a fundamental level by the consideration 
of quantum mechanical phenomena? 
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2. Does using quantum states and measurement for classical communication have practical ad¬ 
vantages over using classical techniques? 

In an attempt to answer these questions, one of the primary goals is to study the ability of a 
quantum channel to communicate classical information, that is, bits. Like many communication 
problems, this ability is quantified by the notion of channel capacity. The classical capacity C 
of a quantum channel J\f is defined to be the maximum rate of communication such that the 
decoding error probability can tend to zero in the limit of many channel uses. With this definition, 
one natural question is to determine how to compute the classical capacity. We know that the 
Holevo-Schumacher-Westmoreland (HSW) theorem |Hol98[ ISW97| provides a lower bound: 

C(A/')^X(V)= sup I{X-,B)p, (1.1) 

{Px{x),px} 


where {px{x)-) Px) is an ensemble of quantum states such that each px can be input to the channel, 
and I{X]B)p = H{X)p -h H{B)p — H{XB)p is the quantum mutual information of the following 
classical-quantum state: 

PXB = YjPx{x)\x){x\x®N{px), (1.2) 

X 

where {|x)} is an orthonormal basis for the classical reference system and H (G)cr is the von Neumann 
entropy of a quantum state gq on system G. The quantity x(A/") is called the Holevo information 
of the channel. 

The classical capacity C{M) can actually be formally rewritten in terms of the Holevo infor¬ 
mation as well, via a procedure known as regularization. Before doing so, note that we obtain the 
HSW theorem by considering only encoding procedures that do not use entangled input^ That 
is, each quantum system sent through the channel is not entangled with any other system that is 
sent. To get the classical capacity, it is generally necessary to incorporate entangled inputs into 
the calculation. The approach given by |Hol98| ISW97| is to multiplex the channel such that one 
use of this multiplexed channel corresponds to multiple uses of the original channel. A multipartite 
entangled state describing several inputs across different uses of the original channel can now be 
simulated by the corresponding single input state to the multiplexed channel. We can therefore 
express the maximum rate for blocks of size n in terms of a Holevo information: 

-XIV®"). (1.3) 

n 


To obtain the classical capacity, we simply allow for inputs entangled across arbitrarily many 
channel uses: 


1 


= lim — 


n^GC n 


(1.4) 


as 


This idea put together with a converse theorem establishes the regularized expression in ( |1.4[ ) 
being equal to the classical capacity. 

Unfortunately, computing the classical capacity this way is clearly intractable. This prompts 
us to look for special cases. We observe that the limit in (1.4) is equal to x(-^) fhe Holevo 
information satisfies a tensor-power additivity property (see Appendix [A| for a brief derivation): 




(1.5) 


^The highest possible rate with this restriction, proven by the HSW theorem to be x(A’), is then a lower bound 
on the classical capacity. 
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Another way to restate the above is that entangled inputs do not increase capacity iff (m is 
satisfied. However, this is not true for some channels, as was shown in |Has09j . The next question 
then appears: What characterizes channels that satisfy S? 

A sufficient condition for a quantum channel to satisfy tensor-power additivity is for it to be 
entanglement-breaking (EB) |HSR03| . Let Ma^b denote a quantum channel, where the arrow 
notation indicates that the channel maps a state of the input system A to a state in an output 
system B. An EB channel is defined such that for any bipartite state paa'^ fhe output state 

® id A') {PAA') (1-6) 


is separable. That is, the output of an EB channel can be written as a convex sum of product 
states. Effectively, the channel “breaks” the entanglement between A and A'. Previous results have 
established that the Holevo capacity of EB channels is additiv^ |Sho02j . This is intuitive since 
any entanglement of the inputs is broken by the channel. Following this line of thinking, one could 
consider making a stronger statement by allowing additional resources to assist the communication 
but not to the point that entanglement can be established. Indeed, |BN05j proves a generalization 
of tensor-power additivity for EB channels with noiseless classical feedback. Furthermore, their 
results show that classical feedback does not increase the capacity of EB channels. 

There are also possible stronger statements in another direction. The direct part of the original 
HSW theorem states that if the rate i? of communication is less than the Holevo information y, 
then there exists a sequence of protocols such that the probability of error for such a sequence 
satisfies 

lim pe(n) = 0, (1.7) 

n^cc 

where n is the number of channel uses. A result from |Hay07| sharpens this claim by showing that 
there exists a sequence such that 

Pe(n) ^ 2-'=-, (1.8) 


for some A: > 0 determined by the channel. The converse part of the original HSW theorem can be 
strengthened in a similar manner. It states that if i? > for any sequence of protocols 7^^, 

then 


lim Pe{n) > 0. 

n^GC 


(1.9) 


This is known as the weak converse. In contrast, a strong converse is symmetric to the achievability 
result above and states that regardless of the protocols used, the success probability decreases to 
zero in the asymptotic limit whenever R > C{M). The strong converse can be sharpened as well 
whenever there is a constant separation between R and (7(A/’), such that the convergence of the 
success probability to zero is exponential in n. 

There are many reasons why we would want to prove a strong converse. First, a strong converse 
enriches our understanding of the capacity. A strong converse along with an achievability theo¬ 
rem shows that the capacity is a sharp boundary between reliable and unreliable communication 
regimes. This, amongst other results, indicates that the classical capacity of a quantum channel 
is a fundamental quantity of interest. Second, a strong converse is more relevant in practice than 
is the weak converse. A realistic quantum communication scheme has a finite blocklength; that is. 


^Shor proved that the Holevo capacity of a tensor product of an EB channel with any other channel is equal to 
the sum of their respective Holevo informations. This form of additivity is stronger than (1.5) and is the one usually 
found in the literature. 
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the encoding is across a finite number of channel uses. Although the weak converse does provide a 
lower bound on the probability of error in the non-asymptotic regime, the strong converse improves 
the bound considerably. It expresses a trade-off between rate, error probability, and blocklength 
restrictive enough to be easily checked numerically or experimentally. 

While a classical version of the strong converse is known for arbitrary discrete memoryless 
classical channels |Wol78[lAri73| . it is still open whether or not strong converses hold for memoryless 
quantum channels. After some early work |QN99|. IWin99j . it has been proved for special cases, 
in particular for channels with certain symmetry |KW09j . for EB channels [WWYMj, and for a 
wide class of quantum Gaussian channels [BGPWW15| . Given the strong converse results for EB 
channels |WWY 14) and the weak converse for EB channels with feedback |BN05j . it is natural to 
ask if these two statements are true at the same time. We can also ask directly for the strong 
converse for unentangled inputs when a feedback link is available. These are the main questions 
that we address in this paper. 

2 Summary of Results 

In this paper, we derive an explicit exponential bound on the success probability of a classical 
communication scheme that uses an entanglement-breaking channel along with classical feedback. 
The same method of proof can be used to establish a bound for arbitrary quantum channels with 
classical feedback, provided that the inputs are not entangled across multiple uses of the channel. 
When the communication rate exceeds the classical capacity, these exponential bounds immediately 
imply strong converse theorems for these settings. 

We now provide an outline of the proof: 

1. First, taking as a starting point the approach of |Nag01| , relating hypothesis testing to unas¬ 
sisted communication, we bound the success probability of an arbitrary feedback-assisted clas¬ 
sical communication protocol by a sandwiched Renyi relative entropy [MLDS^13[ IWWY14] . 

2. Next, one of the main observations from |BN05j is that the sender and receiver’s systems 
are separable at all times throughout such a protocol, whenever the communication channel 
is entanglement-breaking. We use this fact and an entropy inequality from |Kin03j to split 
the relative entropy into two terms. The first term is bounded by an cr-informat ion radius, 
which is a measure of the range of states a channel can output. We then equate this to the 
sandwiched cf-Holevo information [WWYMj, which is a Renyi generalization of the Holevo 
information. The second term is bounded via monotonicity by another sandwiched Renyi 
relative entropy, to which we recursively apply the same argument. 

3. This gives the following bound on the probability of success for any finite blocklength n\ 

Psucc ^ (2.1) 

where R is the rate of communication and Af is the EB channel. It follows from previous 
arguments |WWY 14j that when i? > vfAG. the rierht hand side of is a decaying expo¬ 
nential, thereby establishing the strong converse. We provide an alternate (arguably simpler) 
proof of this fact (similar to those in [MHlll ICMWlGj i by establishing that the a-Holevo 
information converges continuously to the Holevo information as a approaches 1. 

Appendix includes a brief review of the argument for the weak converse from |BN05| . 
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3 Preliminaries 


In this section we provide some necessary definitions, concepts, and previous results used in the 
derivation of 0. 

3.1 Quantum states, measurements, operator norms, and quantum channels 

We start with definitions of relevant mathematical notions from quantum mechanics. Given a finite¬ 
dimensional Hilbert space 1-L^ let B{l-L) denote the algebra consisting of bounded linear operators 
acting onl-L. A relevant measure of a bounded operator X is its Schatten a-norm^ which is defined 
as 

||X|U = {Tr[|Xn}'/“, (3.1) 

where a ^ 1 and |X| = VxtX. 

Now, let T{T-L) Q B{T-L) denote the subset of trace-class operators. The set of quantum states is 
a convex subset of T(T-L) given by 

Sin) = {pe Tin) : pt = p, p ^ 0, Tr p = 1 } , (3.2) 

where the notation p ^ 0 means that p is positive semidefinite. For composite states, we consider 
the tensor product of two Hilbert spaces T-La ^^nd 1-Lb^ denoted by 1-La ®1~Lb- We can obtain from 
the overall density operator pab ^ S{1-La ®1~Lb) the reduced density operator pA corresponding to 
the quantum state on only the A system by performing a partial trace: 

Pa = "^"^b{pab) = G) 1 ^) 5 ) (3.3) 

b 

where {|&)^} is an orthonormal basis of states on the B system. A state pab is separable if it can 
be written as 

PAB = ® Pb^ (3-4) 

X 

where p{x) is a probability distribution and {p\] and {p^} are sets of states. 

A positive operator-valued measure (POVM) consists of a set {A^} of positive semidefinite 
operators indexed by m and corresponding to different measurement results. The set satisfies 
Am = which allows us to interpret the quantity 


Pm — 


(3.5) 


as the probability of measuring m given a quantum state p. 

We next consider maps on trace class operators and in particular quantum states. A linear 
map T : T{1-La) ^ T{1-Lb) is called positive if Vr g T{1-La)’> t ^ 0 implies ^^(t) ^ 0. It is called 
eompletely positive if idji (x) T is positive for an arbitrary auxiliary system i?, where idji is the 
identity map on R. A map T is a quantum ehannel if it is linear, completely positive, and also 
trace-preserving. 
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Figure 1: A general protocol for feedback-assisted classical communication. The sender manipulates 
the systems labeled by A and the receiver those labeled by B. Every X system is classical and 
represents noiseless classical feedback from the receiver to the sender. 


3.2 Classical feedback-assisted classical communication protocols 


Figure [^depicts the most general three-round protocol for classical feedback-assisted classical com¬ 
munication. The generalization to n rounds is clear. The protocol begins with Alice preparing a 
classical register M with the message to be sent. This is correlated with some system Aq. Bob uses 
the classical feedback channel to send a classical system Xq correlated with some quantum system 
Bq to Alice. The global state is then 

Pma'^XqB'^ = ® . ( 3 . 6 ) 

m X 

We will track the state conditioned on a particular value m of the message register M, given by 

Pa'qXqB'q — Pa'q k)(®lxo ® PB'g- (^•'^) 

X 

Alice performs an encoding A' Ai' state at this point is 

Pa^AiB'q - {/a'^XoB'^ ( 3 - 8 ) 

= (pu' ® kXa^lxo) ® Pb'’ (3-9) 


We note that the state p^f separable with respect to the cut A[Ai : Bq. 

Next, Alice uses the EB channel A/ai^Si for the first time by sending system Ai, leading to 
the state 

P^[BiB'q = -^Ai^Bi (^Pa[AiB'q) • ( 3 . 10 ) 

Since the channel is entanglement-breaking and the state before the channel was already separable 
with respect to the A'^Ai : Bq cut, the state after the channel is fully separable. Now, consider 
instead that Alice uses an arbitrary channel but with separable inputs. Then, the state 

BiB' SigAm fully separable. We write it in the form 


no 

PA{BiB'g 


Yp^^y^pA[ 




y,m 


y 


( 3 . 11 ) 
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Bob now applies the decoding map ^XiB' ’ where Xi is the classical system that is sent 

back to Alice. The state at this point is 

Pa[XiB[ = {pA[BiB'q) 

XI Zl 

which is fully separable. Hence, after Alice applies the second encoder and then sends it through 
the channel, the state will still be fully separable. The key observation here is that if we use an EB 
channel or an arbitrary channel with separable inputs, the state is always separable aeross a eut 
that divides Aliee and Bob’s systems. 

The only difference in subsequent rounds is the final measurement. Say there are n rounds in 
the protocol. At the last round. Bob measures the state Pb^b' ^ using a POVM given by {D'^} 
with elements corresponding to different possible messages that Alice sent. 


(3.12) 

(3.13) 


3.3 Renyi relative entropies and bounds on success probability 

An important classical information theoretic quantity is the Renyi relative entropy, which can be 
generalized to the quantum case in a number of ways. In this paper, we use the sandwiched quantum 
Renyi relative entropy |MLDS~*~13l I WWY 14] which is given by 


Da{p\W) = 



: p A (j A (supp(p) ^ supp((j) V (a G (0,1)) 
: otherwise 


(3.14) 

where a g (0,1) u (l,oo) and p A a means p, a are non-orthogonal quantum states. Note that all 
logarithms in this paper are taken base two. 

We now recall some properties of the sandwiched Renyi relative entropy. For fixed p and a, the 
function a ^ Da{p\\a) is monotone non-decreasing |MLDS^13 
relative entropy D{p\\a) |Ume62| in the limit as a 


It also converges to the quantum 


1 |MLDS+13[ IWWYMj : 


lini Da{p\\cr) = D{p\\a), 


(3.15) 


where 


D{p\W) 


Tr [p (log p - log ct)] : supp(p) e supp(c7) 
-hoo : otherwise 


(3.16) 


Furthermore, it satisfies the data-processing inequality for a e [1/2,1) u (1, oo) |FL13[ lBeil3j : that 
is, for all quantum channels V, 


DaX{p)\W{V) ^ DMa). 


(3.17) 


In particular, consider the following replacement channel which simply replaces the input with some 
state uj: 

77c^(p) = Tr(p)n;. (3.18) 

It is easy to see that for all ca, Da{co\\Lo) = 0, implying that 

Da{p\\a) > DainUpWM) = 0, (3.19) 
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which shows that the relative entropy is non-negative whenever its arguments are quantum states 
p and a. 

Using the relative entropy, we can define the sandwiched a-Holevo information |WWY 14) of an 
ensemble, that is, a classical probability distribution of quantum states, {px{x)^px} as 

Xa{{px{x), px}) = inf DaipxnWpx ® o-r) (3.20) 

aRESinn) 

where 

pxi? = y]px(a;)|a;><a;|x ® (3.21) 

X 

and px are states of a system R. With this, we define the a-Holevo information of a quantum 
channel N as 

XaX)= sup Xoc{{vx{x)X{Px)]) ■ (3.22) 

{Px{x),px} 

We also define the ^-information radius of a channel M as 

KaX) = inf sup Da{M{p)\\o). (3.23) 

creSiW) peS(H) 

We note that since is monotonically non-decreasing in cr, so are and 


4 Strong converse 


This section is dedicated to proving the main theorem of this paper. We will need the following 
lemmas. The first is proven in |Nag01| and in Lemma 5 of |WWY 14] via monotonicity of Da- 
The second will be used to take advantage of the separability of the quantum state observed in 
Section The third is an equality between ct-Holevo information and ct-information radius. The 


fourth shows that a-Holevo information and ^-information radius tends to the conventional Holevo 
information and information radius in the limit a ^ 1. Note that this establishes Lemma [3] as a 
generalization of the equality x(A/’) = K{J\f) |QPW97l ISWOli rSW02| . 


Lemma 1. Let a > 1, p^a e S{T-L), and A be sueh that 0 ^ A ^ Let 


p = Tr [Ap] , q = Tt [Act] . (4.1) 

Then 

5„(p||<7)^^log[pV“]. (4.2) 

a — 1 

Lemma 2 r [Kin03( IHol06] L Let Pab be a positive semidefinite separable operator. Sueh an 
operator ean be written in the following form: 

PAB = L<^{E)Di, (4.3) 

3 

where ^ 0 for all j. Let Pb = Tr^ {Pab] ciRd let AAa be a eompletely positive linear map 

aeting on the A system. Then, for all a ^ 1, 


||(A4A0idB) (Pab)IIq, ^ I'aiMA) ■ ||.PB|la> 


( 4 . 4 ) 


















where VaiJ^A) is the 1 ^ a norm of Ma, defined as 


i^a{MA) = sup 

x^o,XeT{'HA) 


\\Ma{X)\1 

IIXIL 


(4.5) 


Lemma 3 (jWWYIlj). For a > 1, the a-Holevo information and the a-information radius are 
the sam^ 

Xa{M) = K^{M). (4.6) 


Lemma 4. For a quantum ehannel N, the following limits hold: 


and 


where 


lim Xa(A/') = xi-X) 

OL^l 


lim Ke,{N) = 
a^l 


K(M) = inf sup D{M(p)\\a) 
aeS{H) peS{n) 


(4.7) 

(4.8) 

(4.9) 


is the information 
We now state the theorem. 

Theorem 5. Given any n-round protoeol for elassieal feedbaek-assisted elassieal eommunieation 
over an entanglement-breaking ehannel J\f with rate R, the average probability of sueeess is bounded 
from above by an exponential in n: 


Psucc ^ 


(4.10) 


where Xq, is the a-Holevo information. The same bound holds for an arbitrary ehannel J\f given 
that the eneoder does not entangle inputs aeross different uses of the ehannel. 

Proof. We take as a starting point the approach of Nagaoka |Nag01| , connecting hypothesis testing 
with data processing of a Renyi information quantity. Let Vn be such a protocol. We are bounding 
the average probability of success, so we assume Alice chooses her messages uniformly at random. 
Using the notation of Section [3^ the state we have at the final round of the protocol Vn is 

L 


PMBnB'_, - 


Vi = 7 E 


m—l 


mM ® 


(4.11) 


where L is the number of possible messages. 

Following the argument in [CMW16] , we can write the success probability as 


PsMCC 




nm m 

^Bn B' , PBn B' 1 
^ n—1 ^ n—1 


= Tr[l 




(4.12) 


^To get all a > 1, we actually need to extend the lemma in [WWY14] slightly. This follows from the proof 
given there and the observations that for all a > 1, is operator convex, Tr{x^} is convex, and Da is jointly 

quasi-convex. 

^This lemma is proved in Appendix 
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Figure 2: A protocol for feedback-assisted classical communication when using the replacement 
channel. Notice that the communication line between Ai and M is broken. 


where 

— X! I^X^Im (8)(4.13) 

m 

Note that 0 ^ T ^ so that {T, / — T} is a POVM. 

We now consider the state Tj^^nB' ^ defined to be the final state if we had implemented Vn 
using a replacement channel TZa- instead of the original channel (see Figure]^. The overall state in 
this alternate scenario has a simple expression since Bob’s states are now independent of m: 






where tm = lujL. This allows us to compute the following: 


Tr T; 


MBnB'^_^^MBnB'^_ 




1 

Z’ 


(4.14) 


(4.15) 


where the last equality follows because ^ Tr[crB^ J = 1. This equality is 

intuitive, since for a replacement channel, the receiver Bob cannot do any better than to guess the 
input message m at random. Letting T be the operator A in Lemma [T] we conclude that for a > 1, 


which can be re-written as follows: 

(X ~ 

-7 log(T SUCC ) + logL ^ DaipMBnB' 

<X — 1 


Psncc ’ 


(4.16) 


(4.17) 


We would therefore like to bound Da{PMB„B' ^ \\tmb„b' j)- To do so, consider that 


Da{PMB„ 




- Da{MAn^Bn{PMAnB'^_^)WMB'^_^ ®(^B„) 


a 

a — 1 


log 



(l-a)/(2a) 
MB' 


Pma^b'^_Jmb'„ 


(l-a)/(2a)' 


(4.18) 


10 

































where in the last equality, we define 0 by 


&a(p) = 


(4.19) 


and the identity operation on the other systems is implied. We now use the key observation from 


Section 3.2 (and used in [BNOBj i that if JV is EB or if Alice uses separable inputs, throughout Pn, 
Alice and Bob’s systems are always separable. Furthermore, the M system is classical, so Pma„b' ^ 
is separable with respect to the An : cut. This implies that 


(l-a)/(2a) (l-a)/(2a) 


is a positive semidefinite separable operator: 


(l-a)/(2a) (l-a)/(2a) (l-a)/(2a) t , -si „ J 


(1-q)/(2q) 
MB' , 




.(l-a)/(2a) J 


dl-«)/(2a) 




(4.20) 

(4.21) 

(4.22) 


Since conjugation by a positive semidefinite operator is clearly a completely positive map, we can 
apply Lemma to conclude that 


I \r \ f (l-c^)/(2ci) (l-ci)/( 2 ci) 

I °-^An^Bn j yMB'^_^ P^^nB'^_jMB'^_^ 


^ 1 0 oMa 


n 


(l-a)/(2a) _ 


(l-a)/(2a) 




(4.23) 


We then have the following chain of inequalities: 

-Sa(PMS„S' , IkMBnB' , ) ^ -7 log t'a j 0 ° Af ) +-Sa(pMB' , I^MB' , ) 

ni Q/ — i J n 1 n i 

^ -vlogr'aj 0 oJ\f j + Da{pMBn-iB'„ J'T'MB, 

a-1 y aJ 

^n—^logl/a 0 oA/" + DaipMB'JrMB'J 
a —I \ (T„“ / 0 0 




a 


= n 


log VoL © o M 
a-1 \ fT - 


a 


= n- 


cr — 1 


log sup 

Pa^B{'Ha) 


( 0 oJVa^b ) (pa) 
\ / 


(4.24) 

(4.25) 

(4.26) 

(4.27) 

(4.28) 


The first inequality follows by combining (4.18) and (4.23). The second inequality follows from 


monotonicity of the sandwiched Renyi relative entropy under the partial trace channel. The third 
inequality follows by recognizing that Da{pMB„-iB' 2 ^ relative entropy at round 

n — 1 of Pn, which allows us to apply the above argument inductively. This is a crucial step of the 
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argument and proves a form of additivity similar to that of (m. The first equality is a consequence 
of the fact that Pmb'^ ^ since no channels have been applied at that point in the protocol. 

The last equality follows from a result from |Aud09j which allows us to take the supremum over 
quantum states instead of all trace class operators (furthermore from the fact that quantum states 
have trace equal to one). Hence, we have that 


(y /^ \ 1 T 

logPsucc('Pn) + log L ^ n -^ log sup 


a — 1 


a — 1 




© oMa-^B ) (pa) 


(4.29) 


Since this is true for all states cts, we can conclude that 


o. 

logpsucciVn)+ logL inf sup --log 

aseSiUB) p^eSiUA) ~ ^ 

= nKa{Af) 

= nXoc{M). 


( 0 l-a O Ma^B 

V 



(4.30) 

(4.31) 

(4.32) 


The first equality comes from the definition of the ^-information radius, and the second equality 
is due to Lemma m 

Now, the protocol uses the channel n times and the rate R of Vn is defined to be the number 
of bits per channel use, so that R ^ This allows us to introduce R into the inequality: 

1 a - 1 

- logPsucc(7^n) ^- {R - Xa(Ar)). (4.33) 

n a 


Since this is true for all a > 1, we can take a supremum over a > 1 and arrive at the bound stated 
in CTol). □ 


The strong converse itself now follows from Theorem and Lemma 

Corollary 6 (Strong Converse). The probability of success of any sequence of protocols which 
use an entanglement-breaking channel with classical feedback at a rate greater than the classical 
capacity is bounded from above by a decaying exponential. The same is true for arbitrary channels 
with separable inputs. 

Proof. Recall that since is monotonically increasing in cr, so is Xc^. This along with Lemma 
implies 


inf Xa(A/') = lini 
a>l a\l 

= x(-V). 


(4.34) 

(4.35) 


Hence, if i? > by the continuity of Xa{J^) as a \ 1, there exists a value of cr > 1 such 

that (i? — Xa(A/")) > 0. Then the bound in ( |4.10D implies an exponential decay of the success 
probability. □ 
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5 Conclusion 


By studying the classical capacity of a quantum channel, quantum information theorists found 
that an implication of quantum mechanics for classical communication is the possibility of superior 
encoding schemes that use entanglement, a uniquely quantum phenomenon |Has09j . However, en¬ 
tanglement does not give an advantage for every channel. The results in this and previous papers 
show that for entanglement-breaking channels, no communication protocols, even with classical 
feedback, can take advantage of this extra resource. They cannot use it to increase their capac¬ 
ity, nor, as proved in this paper, to even lift the exponentially decaying ceiling on their success 
probabilities. We perhaps expect this since by definition EB channels destroy entanglement, and 
classical feedback channels cannot create entanglement. This is the guiding principle behind our 
proof, in particular the key fact that the transmitter and receiver states are separable throughout 
the protocol. 

More generally, it is known that classical feedback can give a large boost to the classical capacity 
of channels which are not entanglement-breaking in at least two different ways: first, there exist 
channels for which the single-copy Holevo information is small but the single-copy Holevo informa¬ 
tion with feedback included can be quite large |BDSS06] . Similarly, there exist channels for which 
the classical capacity is small but becomes large when classical feedback is available |SS09| . In light 
of these prior results, this paper in some sense gives the strongest result that we could expect: going 
outside of the class of entanglement-breaking channels gives both nonadditivity and big gains from 
feedback. Thus, showing that the Holevo information is a strong-converse bound suggests that for 
feedback not to help, we require special channels, such as the entanglement-breaking ones. 

Going in the opposite direction, it is known that EB channels form a proper superset of classical 
channels |KHH12j . We thus obtain the result of |PV10j as a direct corollary. Eurthermore, since the 
original posting of our paper as arXiv: 1506.02228, an open question that we posed has now been 
answered — the strong converse exponent from Theorem]^ is tight, due to the results in |MQ15| . 
Hence, we obtain as a special case from this and our result the classical results of |Aug781 Id™ 
ICK82| as well. 

A possible direction for future research is to ask the same question for Hadamard channels, 
which are defined to be complements of EB channels. To define the complementary channel, we 
first explain the interpretation of channel as a model for open quantum dynamics. That is, a 
channel from system A to system B can be interpreted as the restriction of a unitary interaction 
with larger system BB, where B is referred to as the “environment.” The complementary channel 
is the map from A to BB tracing out the B system instead and is itself a channel from A to B. 
Hence, Hadamard channels break any entanglement with the environment system that is traced 
out and is thus related to our guiding principle. The strong converse has already been proved for 
Hadamard channels |WWY 14) . but with the addition of classical feedback, even a weak converse 
has yet to be proved. Examples of Hadamard channels include generalized dephasing channels, 
cloning channels, and the Unruh channel |BHTW10] . 

Einally, we remark here that it should be possible to use the methods given here and in |CMW16] 
in order to characterize a particular adaptive hypothesis testing scenario. Suppose that the goal is 
to distinguish an entanglement-breaking channel from a replacement channel by means of adaptive, 
separability-preserving channels. Then the optimal strong converse exponent should be given in 


terms of a quantity similar to that in (4.10). However, we leave the details for future work. 
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A Additivity 


Here we justify the statement in (1.5). The backward implication is trivial. As for the forward 
implication, we first note that given n, for all k e 


Aw®”) ^ 


n 


(A.l) 


because we can split the kn channels into k blocks of n channels and encode each block inde¬ 
pendently. In particular, ^ To show the opposite inequality, we assume the 

contrapositive: > x(*^) some n. However, this means 


- XW) ^ Aw®”) - XW) > 0, 


n 


for all k G and thus the limit in (1.4) does not converge to x(*^)- 


(A.2) 


B Weak converse and finite bounds 

We recall for motivation and the reader’s convenience the argument for the weak converse from 

[BNO^ . 

Theorem 7. Let Vn be a protoeol for elassieal feedhaek-assisted elassieal eommunieation over an 
entanglement-breaking ehannel J\f sueh that it uses the ehannel n times, has eommunieation rate 
R, and has average probability of deeoding error e. Then, it satisfies the following inequality: 

+ 9 {n,e), (B.l) 

where xU^) i^b.e Holevo information of the ehannel and g{n,e) is a real valued funetion sueh that 

lim lim g(n,e) = 0. (B.2) 

e\0n^(X)^^ ^ ^ 

The same is true for a protoeol for eommunieation over an arbitrary ehannel given that the eneoder 
does not entangle inputs aeross different uses of the ehannel. 

Proof. We use the notation in Section [3^ and take a general approach that is, for instance, presented 
in Section 19.3.2 of |Willl| . Let $ denote the following “shared randomness” state: 

IWWIm> (B-3) 
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where \M\ is the size of the message set Ai. Now, suppose the information processing task is 
common randomness generation instead of classical communication. If we require the state p at 
the end of the protocol to be 6:'-close to $ 


11$ - Pill (B.4) 

the Fannes-Audenaert inequality for mutual information |Fan73[ IAud07| gives 

nR = I{M]M)^ (B.5) 

+ (B.6) 

where /(n, e') is some continuous function of n and e' with the property: lim^/x^o lin^n^oo ^ 

0. Continuing, we have 

(B.7) 

= /(M; + /(M; (B.8) 


where we applied the data processing inequality and the chain rule for conditional quantum mutual 
information. We use the chain rule again to get 

^ (B.9) 

From Section [3^ if J\f is entanglement-breaking or if Alice uses separable inputs, Alice and Bob’s 
systems are always separable. Hence, the global state before the nth channel use can be written as 

m 

= YjPm (m) 

my 

Hence, the state after the channel is given by 

PMA'„BnB'^_, = h}(mjj^^®J^PYiM(ylm)J\fA^^B,r(^PA'fAn) ® Pb^_P 

my 

We introduce an auxiliary system Y that labels the separable sum over the index y. That is, Y is 
chosen such that the global state is 

PMYA'^BnB'^_^ = E PM {m)pY\M{y\fn) \'m){m\^ 0 \y){y\Y ®YAn^Bn {/a'^a^ ®PbY 

m,y 


We trace over to get 

PMYBnB'^_, = YiP^^^^Py\^^y^^^^'^Ym\M®\y)(y\Y®PBn 


m,y 


m,y 
f' 1 ’ 

n—1 


where 


f . TV.4, (pXf) ■ 


m,^ _ 

Pbu = 


(B.14) 


(B.15) 
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This allows us to argue 


I ^ I {MYBYi; Bn)^ 

(B.16) 

= IiMY-,Bn)^ + l{BYi,Bn\MY)^ 

(B.17) 

= nMY-,Bn)p 

(B.18) 

^ x(V), 

(B.19) 


where the first inequality is from data processing, the first equality from the chain rule, and the 
second equality because the system is in a product state when conditioning on M and Y. 

Finally, given equation (B.15), the state PmyBu is a classical-quantum state of the form: 


PMYBn - {pmyb,,b'^_^ (B.20) 

= Y^Pm im)pY\Miy\m) |y><y|y {pa^) ■ (B.21) 

m,y 


Hence, the definition of the Holevo information of the channel Af gives us final inequality. Putting 
things together, we find that 




(B.22) 

(B.23) 


where the last inequality follows from the data processing inequality. But now, we recognize the 
quantity / (M; Bn-iB'^_ 2 )^ is of the same form as /(M; BnB'^_i)p in (B.7), so that we can iterate 
through the same sequence of arguments to get 


I(M; M)p ^ 2x(V) + I (M; 3^-23^^)^ ■ 
Continuing all the way back to the first channel use, we find 

I{M-,M)p^nx{N) 


since I{M]Bq) = 0 (see (3.6)). Thus, we conclude 


R ^ X(-V) + -f{n,s'). 
n 


(B.24) 


(B.25) 


(B.26) 


Now, the rate for classical communication with n channel uses and error is at most that of 
randomness generation with n uses and error Y^ where Y is some function of 6: such that lim^x^o = 
0. Hence, g{n^e) = is what we need. □ 


Corollary 8 (Weak Converse). The classical capacity of entanglement-breaking channels with 
classical feedback is given by the Holevo information. The same is true for arbitrary channels with 
separable inputs. 


Proof. This is immediate. 


□ 
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C Limit value of the a-Holevo information 


The arguments here are essentially the same as those in |MH 11 [ fCM W16 j . along with an additional 
insight from |TWW14t Appendix A]. We first recall a minimax result due to |MH1H Corollary A2]. 


Lemma 9. Suppose a function f : X x Y M., where X is a compact topological space, Y is a 
subset of W, and M is the extended real numbers, satisfies 

1. \/y E Y, f{'^y) is lower semicontinuous. 

2. Mx E X, f{x, •) is monotonic. 

Then, 

inf sup f{x,y) = sup inf f{x,y). (C.l) 

xeX y^Y yeY 

If the first condition was instead 

1. My E Y, fi'^y) is upper semicontinuous, 
then 

inf sup f{x, y) = sup inf f{x, y). (C.2) 

xeX xeX 


We now prove Lemma 

Proof For fixed pxR' si^nd a, gri ^ Dot{pxR'^ Px®(^R') is lower semicontinuous (see, e.g., |CMW16] 
Appendix A]). Furthermore, for fixed pxR' and (Jr/, a ^ Da{pxR'^ Px ® o-r/) is monotone non¬ 
decreasing. We can therefore invoke Lemmawith X = S{1-Lri) and Y = (0,1). We use this, the 
definition of the Holevo information, and properties of the quantum relative entropy to find the 


following one-sided limit of the cr-Holevo information of a quantum channel N : 

: RUr) - nn^)-. 

lim Xoc{X) = sup Xoc{X) 
ae(0,i) 

(C.3) 

= sup sup Xa{{Px{x),M{Px)}) 

ae(0,l) {px{x),px} 

(C.4) 

= sup sup inf Da{pxR'\\px®(^R') 

{px{x),px} Q;e(0,l) 

(C.5) 

= sup inf sup Da{PxR'\\px®crR') 

{px{x),px} ^R'^^V^R') ae(0,l) 

(C.6) 

= sup inf D{pxR'\\px®CFR') 

{Px{x),px} ^R'^^y^R') 

(C.7) 

= sup I{X : R')p 

{Px{x),Px} 

(C.8) 

= x(-V), 

(C.9) 

where 


PXR' = TiPx{x)\x)(x\x ® [X{Px)\ri ■ 

(C.IO) 


X 
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The fourth equality follows from applying Lemma|^ The fifth follows from (3.15) and the fact that 
the sandwiched Renyi relative entropy is monotone non-decreasing in a. 

To find the other limit, we note that for fixed pR and aR/^ a ^ Da{J\f{pR)\\crR') is monotone 
non-decreasing. We then use an idea from |TWW14l Appendix A]. Note that for fixed a and 
positive definite aRf^ pR ^ Da{J\f{pR)\\crR/) is continuous (and thus upper semicontinuous). Let 
s G (0,1) and define cr (s) = {1 — s) a + gtt, where tt is the maximally mixed state. Thus a{e) is 
positive definite for a positive semidefinite. Consider that 


cr(e) ^ (1 - e) cr, 


(C.ll) 


which implies that 


D{p\\aie)) ^ D{p\\ (1 - s) a) = D{p\\a) - log (1 - e). (C.12) 

(This is because of the well known fact that a ^ a' implies that D{p\\a) ^ D{p\\a').) Hence, using 
Lemma Lemma and the identification of the Holevo information as an information radius 
|OPW97t ISWOlj . we can conclude the following chain of equalities: 


lim XaW) 

= inf XaW) 

(C.13) 

a\l 

a>l 


= inf K^{M) 

OL>l 

(C.14) 


= inf inf sup Da{M{pR)\\(jR,) 

(C.15) 


^ inf inf sup {Af {pR)\\cr{e)R>) 

a>l pj^eSinn) 

(C.16) 


= inf inf sup Da {Af ipR)\\a{s)R/) 

a>l pj^^SiHR) 

(C.17) 


= inf sup inf Da {J\fipR)\\a{e)R') 

(7R,eS{nR,) pReS(HR) 

(C.18) 


D{M{pR)\\a{e)R>) 

crR,eS{HR,) preSCHr) 

(C.19) 


^ inf sup D{J\f{pR)\\aR')-\og{l - e) 

^Pr^S{'Hr) 

(C.20) 


= X(-V) - log (1 - e). 

(C.21) 


The first inequality follows because there is an infimum with respect to aR/^ such that replacing aRf 
with a{e)Rf can never decrease the quantity. The fifth equality follows from applying Lemma 
The sixth equality follows from (3.15) and the fact that the sandwiched Renyi relative entropy is 
monotone non-decreasing in a. The last inequality follows from (C.12). The last equality uses 
X{M) = K{Af). 

Given that g > 0 was arbitrary, we can conclude that limQ,\^i Xq.(A/’) ^ x(*^)- Combined with 
the fact that ^ x{J^) all a ^ 1, we conclude that 


This also implies that 


lini XaW) = x(-V). 

a\l 

(C.22) 

lim Ka{X) = K{M). 

a\l 

(C.23) 
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We now show that limayi Ka{J\f) = which follows from a similar line of reasoning: 


lim Ka{M) = sup Ka{M) 

"/■I a6(0,l) 

(C.24) 

= sup inf sup Da{N{pR)\\aRi) 

Q:G(0,1) PR^^iP'R) 

(C.25) 

= inf sup sup ba{N{pR)\\aRi) 

aR,eS(HR,) Q;e(o,i) pReS(nR) 

(C.26) 

= inf snp sup ba{N{pR)\\aRi) 

aRieS{HRi) pReS(UR) ae(0,l) 

(C.27) 

^ D{-N'{pR)\\aRi) 

'^r'^S{Hr,) pReSiHR) 

(C.28) 

= K{M), 

(C.29) 


where the third equality follows from Lemma and the fact that for fixed a and gr/ ^ 
Da{’N'{pR)\(^R') is lower semicontinuous and that the pointwise supremum of lower semicontinuous 
functions is lower semicontinuous. □ 
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